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Abstract 

A Boolean function / : {0,1}™ —> {0,1} is said to be noise sensitive if inserting a 
small random error in its argument makes the value of the function almost unpredictable. 
Benjamini, Kalai and Schramm BKS99 showed that if the sum of squares of influences in / 
is close to zero then / must be noise sensitive. We show a quantitative version of this result 
which does not depend on n, and prove that it is tight for certain parameters. Our results 
hold also for a general product measure u p on the discrete cube, as long as log 1/p <C logn. 
We note that in [BKS99 , a quantitative relation between the sum of squares of the influences 
and the noise sensitivity was also shown, but only when the sum of squares is bounded by 
nT c for a constant c. 

Our results require a generalization of a lemma of Talagrand on the Fourier coefficients 
of monotone Boolean functions. In order to achieve it, we present a considerably shorter 
proof of Talagrand's lemma, which easily generalizes in various directions, including non- 
monotone functions. 

1 Introduction 

The noise sensitivity of a function / : {0, l} n — > {0, 1} is a measure of how likely its value is to 
change, when evaluated on a slightly perturbed input. Noise sensitivity became an important 
concept in various areas of research in recent years, with applications in percolation theory, 
complexity theory, and learning theory (see e.g. [BKS99 , [HasOl], [BJT99]). We work with a 
dual notion of noise sensitivity, namely noise stability, defined as follows. 

Definition 1. For x G {0, l} n , the e-noise perturbation of x, denoted by N e {x), is a distribution 
obtained from x by independently keeping each coordinate of x unchanged with probability 1 — e, 
and replacing it by a random value with probability e. For this purpose we assume that a product 
distribution [i on the discrete cube {0, l} n is defined, however we leave it implicit in the notation. 

The noise stability of f is defined by 

5 e (/) d =C0V^ i y ~ Ne{x) [f(x),f(y)}. 

Roughly saying, a function is noise-sensitive if its noise stability is close to zero. 

Another concept that was intensively studied in recent decades is that of the influences of 
coordinates on a function. 
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Definition 2. Let f : {0,1}"" — > {0,1} be a Boolean function, and let i G {l,...,n}. The 

influence of the i 'th coordinate on f is defined as 

I l {f) = Vv x ^[f{x)^f{x@e i )}, 
where the vector obtained from x by flipping the i 'th coordinate. 

Influences were studied in economics for decades, and first found their way into computer 
science in |BOL90| . in the context of cryptography. The study of influences has numerous 
applications in mathematical physics, economics, and various areas of computer science, such as 
cryptography, hardness of approximation, and computational lower-bounds (see e.g. [LMN93J, 
[D505| . |Mos09j . or the survey [KS06 ]). 

Relations between influences and noise sensitivity. The noise sensitivity of a function 
and its influences both measure how likely it is to change its value when the input is slightly 
perturbed. It makes sense to study the relations between these concepts. Perhaps counterintu- 
itively, it turns out that functions with very low influences must be very sensitive to noise. This 
phenomenon was first shown in a paper by Benjamini, Kalai, and Schramm [BKS99]. They 
proved the following theorem: 

Theorem 3 ( [BKS99] ). Let {f m : {0, l}" m -> {0, l}} m=lj2) be a sequence of Boolean func- 
tions, such that 

J>(/) 2 ^>o. 

1=1 

Then for any e > 0, S t (f m ) m ^°°) o. 

The BKS theorem was proved in |BKS99j only with respect to the uniform measure on 
the Boolean cube, and the case of highly biased product measures was left open. However, 
for some applications, such as in the study of threshold phenomena, one is often interested in 
biased measures. Moreover, the BKS theorem is qualitative, and does now show a concrete 
relation between the influences of a function and its noise stability (a quantitative relation was 
shown in [BKS99], but only for the case where for a function / : {0, l} n — > {0, 1}, the sum of 
squares of the influences is inverse polynomially small in n). 

1.1 Our results 

In this paper we show a quantitative version of the BKS theorem. With respect to the uniform 
measure, we prove that 

Theorem 4. There exists a constant C > .234 such that the following holds. Let f : {0, l} n — > 
{0, 1}, and denote 

1 n 

M/) = ^E^(/) 2 - 

i=l 

Then 

S e {f) < 20 • 
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The main technical tool used in the proof of Theorem U and also in the original qualitative 
result of [BKS99], is a generalization of a lemma by Talagrand [Tal96]. Talagrand's result 
considers the Fourier- Walsh expansion of a monotone Boolean function, and bounds its weight 
on second- level Fourier coefficients in terms of its weight on first level coefficients. This lemma 
is of independent interest, and was used by Talagrand to estimate the correlation between 
monotone families |Tal96j . and the size of the boundary of subsets of the discrete cube |Tal97 . 
The generalization gives a similar bound on the weight on d-level coefficients. 

While in the paper of [BKS99J only a qualitative estimate was given in the generalization 
of Talagrand's lemma, the main technical tool in our proof is a quantitative version of it. To 
obtain it, we simplify the proof of Talagrand's lemma in a way which makes its generalization 
quite simple and straightforward. Our result for the uniform measure is the following: 

Lemma 5. For all d > 2, and for every function f : {0, l} n —> {0, 1} such that 

W(/)<exp(-2(d-l)) (1) 
(where W(/) is as in Theorem^), we have 

E f^ 2 * t • f^r)^ 1 • W) ■ ( log WW)))"" 1 - ( 2 ) 



Lemma O as well as Theorem 01 hold also for functions into the segment [—1,1], if one 
appropriately extends the definition of the influence. Specifically, one should define Ii(f) = 
\\f(x li ~°) — /(x 1- * -1 )]^ where x % ^ a is the vector obtained from x by inserting a in the i'th 
coordinate. This holds also for the biased case, discussed below. For simplicity, we assume that 
/ is Boolean in the proofs. 

We note that Talagrand also proves a "decoupled" version of his lemma. While we do not 
need a decoupled version for the proof of Theorem HI we prove one for the sake of completeness 
in the Appendix. 



Biased measure. In the study of threshold phenomena, and for other applications, often 
one is interested in biased measures rather than the uniform measure over the discrete cube. 
Once the proof of Talagrand's lemma is simplified, it becomes easier to apply it also for biased 
measures. Below are our analogous results with respect to the p-biased measure on the discrete 
cube. The coefficients below are with respect to the "p-biased" Fourier- Walsh expansion (see 
Section [2]). 

Lemma 6. Let f : {0, 1}™ — > {0, 1} ; and denote 



W{f)=p{l- P )-Y,h{f? 



For all d > 2, if 
then we have 



W(/) <exp(-2(d-l)), (3) 
where B(p) is the hypercontractivity constant defined in Section^ 
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We note that a bound slightly weaker than in Lemma [6] can be obtained from Lemma [5] by 
a general reduction technique, as was observed in [KellO| . However we prove Lemma [6] directly, 
and Lemma follows as an immediate corollary. 

Using Lemma [6] we can prove an analogue of Theorem H] for the case of biased measure. 
Theorem 7. For all d > 2, and for every function f : {0, l} n — > {0, 1} the following holds. 
Denoting W(f) = p(l — p) • Y17=l ^*(/) 2 > we have 

S e (/)<(6e + l)W(/) a(e) ' e , (5) 

where 

a{C> e + log(25(p)e) + 3 log Iog(2B(p)e) ' 
and B(p) is the hypercontractivity constant defined in Section^ 

We note that for small p, B(p) pi og (i/p) > an< ^ thus Theorem [7] is useful only when log(l/p) 
is assymptotically smaller than log(ra). Indeed, when p is inverse polynomially small in n the 
BKS theorem does not hold even qualitatively - there exist functions which have assymptotically 
small influences but are noise stable. The graph property of containing a triangle with respect 
to the critical probability p is an example of such a function. 

Tightness. Our main result (Theorem [7]) is tight for small p up to a constant factor in the 
exponent of W(/), which tends to 1 for small e, and for p = 1/2 it is tight up to a constant 
factor in the exponent. In Section [5] we prove this, and also discuss the tightness of Lemma [6j 
showing that it is essentially tight. 

Organization. This paper is organized as follows: in Section [2] we recall the definitions of the 
biased Fourier- Walsh expansion, hypercontractivity estimates, and some related large deviation 
bounds. In Section [3] we present the proof of Lemma [6] (which immediately implies Lemma [5] 
as well). In Section [4] we show that Lemma [6] implies Theorem [7J (and also Theorem |4|). In 
Section [5] we discuss the tightness of our results. Finally, in the Appendix we prove a decoupled 
version of Lemma El 

2 Preliminaries 

2.1 Biased Fourier- Walsh Expansion of Functions on the Discrete Cube 

Throughout the paper we consider the discrete cube = {0, l} n , endowed with a probability 
product measure fj, = /jl p (8) • • • <8> fJ- p , i.e., 

n 

p{ X ) = fJ,(^X 1 ,...,X r Sj = Y[p Xl (l -p) l ' X \ 

i=l 

Elements of Vt are represented either by binary vectors of length n, or by subsets of {1, 2, . . . , n}. 
Denote the set of all real- valued functions on the discrete cube by Y. The inner product of 
functions /, g S Y is defined as usual as 

(f,g)=E[fg}= rtx)f(*)g(x). 

x<={0,l} n 

This inner product induces a norm on Y: 

\\f\\2 = vuj) = vnp}- 
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Walsh Products. Consider the functions {wj}™ =1 , defined as: 




As was observed in [Tal94j , these functions constitute an orthonormal system in Y (with respect 
to the measure n). Moreover, this system can be completed to an orthonormal basis in Y by 
defining 

for all T C {1, . . . ,n}. The functions ut are called (biased) Walsh products. 



Fourier- Walsh expansion. Every function / 6 Y can be represented by its Fourier- Walsh 
expansion with respect to the system {wr}rc{i,...,«} : 

Tc{l,...,n} 

The coefficients in this expansion are denoted 

f(T) = (f,uj T ). 

A coefficient f(T) is called k-th level coefficient if \T\ = k. By the Parseval identity, for all 
/ £ Y we have 

E /W = 11/112- 

TC{l,...,n} 



Relation between Fourier- Walsh expansion, noise stability, and influences The noise 
stability of a Boolean function can be expressed in a convenient way in terms of the Fourier- 
Walsh expansion of the function. 

Claim 8. For any function f : {0, l} n — > {0, 1} and for any e > 0, we have 

5 e (/) = E( 1 - e ) |S| /W- 

The assertion is obtained by direct computation in the case where / is a linear character, 
and it follows for general characters by multiplicativity of expectation for independent random 
variables. It then follows for the general case by linearity of expectation. 

The influences are also related to the Fourier- Walsh expansion. It can be easily shown that 
p(l — p) ■ Yl7=i^i(f) = Yls \S\f(S) 2 - Moreover, the influences are specifically related to the 
first-level Fourier coefficients. Indeed, denoting by X-i G {0, l}[ n l\w the vector obtained from 
x by omitting the z'th coordinate, we have (for any Boolean function / and for any 1 < i < n): 

\f({i})\ = \E x [u,i(x)f(x)]\ < E a; _. e{0il} w\ { i} [|E^ e{0>1} Mx)/(x)]|] = VP(1-P)W), 

and thus, 

n 

E/(W) 2 <W). (6) 

i=l 

These expressions of noise stability and influences in terms of the Fourier- Walsh expansion 
play an important role in our proof. 
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2.2 Sharp Bound on Large Deviations Using the Hypercontractive Inequal- 
ity 

A crucial component in the proof of Lemma is a bound on the large deviations of low-degree 
multivariate polynomials. Formally, for any d > 1, we would like to bound the probability 
P r [|/| > t], for every function / whose Fourier degree is at most d. In the uniform measure 
case, such bound was obtained in [BKS99] using the Bonami-Beckner hypercontractive inequal- 
ity [Bon70t lBec75j . In the biased case, one should use a biased version of the Bonami-Beckner 
inequality instead, and the strength of the obtained bound depends on the hypercontractivity 
constant, which depends on the bias. The optimal value of the hypercontractivity constant for 
biased measures was obtained by Oleszkiewicz in 2003 [Qle03| . For ease of presentation, we cite 
a large deviation bound, presented in [DFK()07pl . that relies on a slightly weaker estimate of 
the hypercontractivity constant. 



Definition 9. For all < p < 1, let 

B(p) 



v 



V 

1-p 



(l-p)-p 



2 In i=£ 2p(l -p)(ln(l - p) -hip)' 



(7) 



Theorem 10 (Lemma 2.2 in |DFK()07^ . Let f : {0,1}™ - 
and assume that \\fW2 = 1- Then for any t > (2B(p)e) d / 2 , 



have Fourier degree at most d, 



Pr[|/| >t] <exp 



d 



2B{p)e 



t 2 / d ). 



(8) 



The next lemma, which easily follows from Fubini's theorem, allows using large deviation 
bounds to evaluate certain expectations. The integral that we get when we later apply it, using 
the bounds in Theorem 1101 is considered in Lemma [T2l . 

Lemma 11. Let f2 be a probability space, and let f,g : O, — >• R be functions, where g is non- 
negative. For any real number t, let L{t) C O, be defined by L{t) = {x : g(x) > t}, and let ^-L(t) 
be the indicator of the set L(t). Then we have 



E xen [f(x)g(x)] 



t=o 



(E« en [/(a 



[x)])dt 



Proof: 











^xeu[f(x)g(x)] = E xe n 


f{x) ■ ldt 




/ f(x 




Jt=o 




Jt=0 



(x) dt 



POO 

/ (E xeQ [f(x)-l m (x)]) dt, 
Jt=o 



where the last equality follows from Fubini's theorem. □ 



Lemma 12. Let d > 2 be a positive integer, and let to be such that to > (AB(p) ■ e)^ d 1 ^ 2 . 
Then 



exp 



(d-1) 
2B(p)-e "° 



2/(d-l) 



• (9) 



1 In fact there is a small typo in the formula for B(p) in DFK O07| . which is fixed here. 
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Proof: To bound the l.h.s. of we first apply a change of variables, setting 



2B(p)-e ' 1 j 



and obtaining 



d-l 



2B{p)-e\^ d - l)/2 d-l 



(11) 



s (3^5)/2. exp( _ s) dg 



s=s 



where so = 2B(p)-e ' (^o) 2 ^ d ^ ■ If we denote the integrand on the r.h.s. of (fTTj) by (p(s), one 
notes that for s > sq, tp(s) is decreasing and <p(s + l)/(p(s) < exp(— 1/4) - this follows from 
the condition on to and from ([lUp , It therefore follows that the integral on the r.h.s. of (j lip is 
bounded by (so)^ 3m_5 ^ 2 • exp(-s ) ■ i^^rzim - 5(s ) (3m ~ 5 ^ 2 • exp(-s ). Substituting into 
(|10p gives the lemma. □ 



3 Proof of Lemma [6] 

Notation 1. Throughout the proof, we use a "normalized" variant of the influences: 

i'i(f) = Vv^p)h{f). 

This notation is only technical, and is intended to avoid carrying the factor y/p(l — p) along 
the proof. Note that W{f) = X^=i W) 2 - 

3.1 Two key observations 

The key to the proof of Lemma is based on two observations, as was the proof in |Tal97| . 

First observation. We write the space {0, l} n as a product of two probability spaces {0, 1}^ 
and {0, 1} J . We consider for every j £ J, the part of the Fourier- Walsh expansion of /, which 
consists of Walsh products whose sole representative in J is j. 

We now note that it is sufficient to prove that for every partition {J, J} of {1, . . . , n}, 

\T\=d-l V ' 

The assertion of Lemma[6]will follow from (1121) by taking expectation over the partitions {I, J}, 
such that every coordinate is independently put into J with probability 1/d. We give the exact 
details at the end of the proof. 
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Second observation. The second observation is that we can write the left-hand-side of (|12p 
as the inner-product of / with a function of the form Yl f'j^j ; where the functions | /j | are all 
of low degree, and depend only on coordinates from I. The low degree of the /j's will allow us 
to use Theorem 1101 to bound them. 

For a given partition {/, J} of {1, ... , n} and an index j G J, let 

/;= E f(T,j)u T . 

Td,\T\=m-l 

Note that /j indeed depends only on coordinates from I. We have 
and summing over j we have 



£ f(T,j)u; TU{j} ,f\= J2 f(i T 'j}) 2 > 

Td,\T\=d-l I TCl,\T\=d-l 



(13) 



(14) 



E E^ r ^) 2= Lh - s - of » 

\ j / rc/,|T|=d-iieJ 

It will be convenient for us to normalize /j, hence we take fj = /j/H/jl^- It follows from ([13 
that for every j £ J, 

1/2 



E /({ T >i}) 2 

vTc/,|T|=d-l 



(15) 



3.2 Proof of (HI). 

Using (fT5|) and the fact that /j only depends on coordinates from /, we have for every j £ J 
that 



TcI,\T\=d-l 



E 



x£{0,l}' 



fj( x ) ■^ y e{Q,i} j \ u] ^ x ^y)f i y x ^y)\ ) 

\fj( X )\ ■ ^{O,!}- 7 ^'^^)/^^)] 



(16) 



(17) 



We now use Lemma [TT] to bound (|17p by considering the two multiplicands in the expectations 
as functions over {0, 1}^, and obtain 



(HID < 



E 



i=0 



x€{0,l}' 



L {|/iWI>*} ' 



E 



dt 



Using the inequality (a + b) 2 < 2a 2 + 2b 2 we thus have that for any parameter to, (fT7|) is bounded 
above by 

2 









1 {i/iWi>*} • 



E 



t=t 



xGfO,!} 7 



4l/iWI>*} 



E 1 /e{o,i} J [ fa, i( x »v)/( af >l')] 

E j/e{o,i} j N( x 'y)/( x 'y)] 



dt 



(18) 
(19) 



We will bound separately each of these summands. 
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Bounding (I18p . For z £ {0, l}^ n ', we denote by G {0, th e vector obtained from 

z by omitting the j'th coordinate. Since an indicator function is bounded by 1, we have 



(USD < 2 / E 



to 



t=o 



<2tn- E 



"xe{o,i} J 



E 2/e{0 ,i} J K(x,y)/(a;,2/)] 



^ 2t - ( E xe{0,i} /E y'e{0,i}- A{j} E %e{o.i) [ u iM > %)/0> 2/'. Vi)] ) 



2t o • ( E ^.e{o,i} [ " lx « } 



E ^e{o,i}K(z)/(X)] 



2tg-/'(/) 2 . 



Bounding (I19p . In the computation below, we explain some transitions below the corre- 
sponding line. We note that the two last implications apply if to > (4.B(p)e) 
indeed the case for the to that is chosen later. 



(d-l)/2 



which is 



f CO ^ 

; 1.9) - 2! / - • i • E 

't=to * 

("CO ^ 



xe{o,i} J 



L {|/i(^)l><} 



E 



< 2 



t=to 



dt 



r ■ E 



*=*0 



L {|/iMI>*} 



E y 6 {0,l} J [ £,; i( aJ >f)/( a: >I')] 



(by Cauchy-Schwarz) 



2 /"°° 



(by applying Cauchy-Schwarz on the space {0, l} 7 ) 

2" 



dt 



< — ■ E 

" t 



xe{o,i} J 



^ y&{0A y[uj(x,y)f(x,y)} 



i (d-1) 
t 2 ■ exp ' 



t=t 



2B(p)e 



t 2 /^- 1 )) dt 



(by pulling the expectation outside of the integral, as it does not depend on t, and bounding 
the deviation of fj using Theorem [10] ) 



< We ■ t'^ 1 ■ B(v) • exn ( ^"^ • t 2 '^ 
<10e t t>{p) exp^ 2B{p y e ir 



E xe{o,iy 



E 



ye{o,iyl u} 3( x >y)f( x >y)} 



(using Lemma fT2j) . 



Proving inequality (1121) . Since the sum of (|18p and (|19|) bounds the l.h.s. of (|16p . we have 
from the above bounds that 



£ /(T,j) 2 < 2t 2 -l'(f) 2 + 
Tcl,\T\=d-l 

2 



2 

+10e • t~ Q " d_1 



• exp 



2B(p)-e i -e{o,i}' 



E 2/g{01} j[w j (x,y)/(x,y)]^ 

(20) 
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We now use the following observation: for any fixed x G {0, l} 7 , let f x : {0, 1} J — > {0,1} be 
denned by f x (y) = f(x,y). Then 

(E yem}J [u; :i (x,y)f(x,y)]) 2 = f x ({j}) 2 . 

Since H/rlh < 1 ; by Parseval's identity we have that for every x 6 {0, l} 7 , 



5^( E „e{o I i} jr [ fa, i( a: >l')/( a: »I')]) < 1- 
By summing (|20p over j £ J and substituting (|21[) inside the expectation, we obtain that 

. / (d-1) t 2/(d-l) 



(21) 



E E /(^j) 2 <2^-EW) 2 + 10e -*o ^-B^-exp 



jeJ tci 

|T|=d-l 



We now choose to so that 



25 (p) • e 



o 



(22) 



exp 



2/(d-i) 
2B(p)-e' 



EW) 2 - 



A simple computation shows that 



(to) 



2 (2B{p)-e 



d-1 



d-1 



d-1 



log 



(23) 



and by assumption ([3]), we have that (to) 2 ^ d 1 ) > 4B(p)-e, satisfying the requirement mentioned 
in the bound on (fT9|) . We now substitute to m (1220. obtaining the bound 



E E fei) 2 <5.(^ 

|T|=<f-l 
thus proving (fT2|) . 



ci-l 



d-l 



£W 5 



log 



E^(/) 2 



> (24) 



Completing the proof of Lemma [6l Let us choose J C {1, . . . , n} to be a random subset, 
independently containing each coordinate with probability 1/d, and let I = [n] \ J. For each 
subset S C {1, . . . , ra} of size d, the probability that it can be represented as a pair (T,j) where 
TCI and j £ J, in which case f(S) 2 is included as a summand in the left-hand-side of (|12p . 
is ((d- l)/d) d_1 > 1/e. Hence 



E /» 2 <e-E. 



\S\=d 



E E 

jeJ tci 

\T\=d-l 



(25) 



We wish to apply a similar argument to the r.h.s. of (|12p . using the fact that each coordinate in 
{1, . . . ,n} appears in the sum with probability 1/d. We observe that the function a;log(l/a;) 1 



10 



is concave in the segment [0,exp(— (d — 1))], and by assumption ([3]) the sum Yljej Ij(f) 2 * s m 
this range for any J C {1, . . . , n}. We thus have 



E, 



(^r-MHs=w)) 



d-l' 



5 (2B(p)-e 



d V rf -l 
5 (2B(p)-e 



d-l 



d 



d-l 



E W) a ■ K rma 

,e{l,...,n} / V \Ue{l,...,n}W) 

d-l / / , \ \ d-l 



(26) 



• lo 



d 



d v d-i y " w/ v~""°VW) 

The combination of (|25p and ()26p completes the proof. 



4 Proof of Theorem [7] 

Let / : {0, l} n {0, 1} be a function, and let W = W(f) = p{l - p) • £™ =1 *j(/) 2 - 0ur S oal 
to show that 



is 



Se(f) < (6e + 1)W Q£ , where a 



e + log(2B(p)e) + 31oglog(2S(p)e) ' 



Recall that by Claim [5J we have 



5 e (/) = ^(l-6)l 5 l/(5) 2 . 

S#0 



For some L that we choose later, we write 



0<|S|<L |5|>L 



and bound each of the terms separately. 



(27) 



(28) 



(29) 



Bounding the high degrees term. The second term in f|29|) is dominated by the powers 
of (1 — e). Since \ \f\\\ < 1, we have from Parseval's identity that 



J2 (1 - e)^f(S) 2 < (1 - e) L+1 • E /»* ^ C 1 - £ ) L+1 - 

|S|>L |5|>L 



(30) 



Bounding the low degrees term. Here we neglect the powers of (1 — e) and use Lemma [6] 
to bound the Fourier coefficients of degree d for each 1 < d < L (for d = 1 we use Equation ©). 



0<|5|<L d=l \S\=d d=2 V 7 V 



d-l 



(31) 
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Assume that L < B(p) log(l/W). In that case for any d, 1 < d < L, the ratio between the 
(d + 1) term and the d term in (|3ip is bounded from below by 2. Indeed, this ratio is 



d 



2B(p) ■ e 



d+l \ d 

2B(p) ■ e 1 



rZ-lV- 1 1 flo R {(d+l)/W) \ d - 1 



d \ log((d)/W) J 
2 



log 



d+l 
W 



- - , . d+l\ 
d+l "e" ' ° g V^T) ~ log(l/W) 



, d+l\ 



We can thus replace the sum in (|3ip by twice its last term, thereby getting 



0<|S|<L 



L — 1 



L-l 



W-(log(4) 



L-l 



(32) 



Choosing the value of L. We choose the value of L such that the bound on S e (f) obtained 
from (1291). (l30l) and (1321) is minimized. We thus take 



L = a • log(l/W), where a 



1 



e + log(2B(p)e) + 3Ioglog(2B(p)e) 
Theorem [7] is obtained immediately from the following claim. 
Claim 13. For a and L as chosen in it holds that 

and 

giB) < 6e • W a ' £ . 
Proof: The first inequality follows since 

(EQD < (1 - e) L < exp(-e • a ■ log(l/W)) = W a e . 
To bound (1321) . we first note that 



L 
10e 2 



a 



L-l 



1/ ' I L — 1 



L-l 



w- 1 log (A) 



L-l 



1 



L \log(l/W) 



11 m*h 



w 



2B(p) ■ e 



a 



L-i 



10e 2 
~JT 

10e 2 
L 



1 + 



1 + 



lo: 



L-l 



io g (i/w)7 

log(L) 



2B(p) • e 



a 



L-l 



iog(i/w); 



2B(p) • e 



a 



L-l 



<W + We 2 



2B(p) ■ e 



a 



L-l 



where the inequality before last holds because a < 1 (since B(p) > 1 for all p). 



(33) 
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Since W < W a ' e , to finish the claim it remains to prove that 



Note that 



a ) 2e 



>v . 2B(j>) ■ e \ L ^ 1 ^ ^25(p) • e^ 



a / 25(p) • e \ a 



1 _ a .l os: f 2a (ri"= 



25 (p) • e 
_ 2e 

Hence it is sufficient to show that the exponent above is higher than a ■ e. Since 

. ,'2B(p)-e\ e + 31oglog(25(p)-e)-log(l/a) 
1 — a ■ log 



a J e + log(25(p) • e) + 3 log log(25(p) • e) 

31oglog(25(p) • e) - log(l/o) 



-a ■ e + 



e + log(25(p) • e) + 3 log log(25(p) • e) ' 



we actually need to prove that 3 log log(25(p) • e) — log(l/a) > 0. Substituting the value of a 
and simplifying, this reduces to 

e + log(25(p) -e) + 3 log log (25 (p) • e) < (log(25(p) • e)) 3 . (35) 

It is easy to verify that the function t 3 — t — 3 log t — e is monotone increasing in t for every 
t > 1.5, and for e < 1 its value for t = 1.5 is positive. Hence, since log(25(p)e) > 1.5, (f35j) 
follows. This completes the proof of Claim [131 an d thus of Theorem [71 □ 



Theorem S] follows immediately from Theorem [7J substituting in Equation (|27[) 5(p) = 1 for 
p = 1/2 and bounding e from above by 1. 



5 Tightness of Lemma [6] and Theorem [7] 

In this section we examine a variant of the "tribes" function presented in [BOL90] . We show 
that the assertions of Lemma [6] and Theorem [7J are essentially tight for this function. 

The tribes function over n coordinates with tribes of size r is defined as follows: we partition 
n coordinates into sets (tribes) of size r each, and let the tribes function assume the value 1 
if in at least one tribe all the coordinates are equal to 1, and otherwise. In order to make 
our function approximately balanced with respect to the biased measure we take / to be a 
tribes function with tribes of size 

log n — log log n + log log ( 1/p) 
log(l/p) 

This variant of the tribes function was first considered in |Tal94] . It is easy to see that for this 
choice of r we have E x ^ / _ tp [/(x)] ~ 1 — 1/e. In the following computations we use the symbol ~ 
to denote equality up to constant factors. 
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5.1 Tightness of Lemma [6] 

To simplify the computations, we add a restriction on the range of parameters we consider. We 
require that 

d < min(l/p, \/r, log n/ log log n). (36) 

We note that a wide range of combinations of the parameters satisfies this restriction. For 
example, if p = 1/logn, then (f36j) holds for all d < ydog nj (2 log log n) . 

Evaluating the right hand side of (|4j). For each x G {0, l} n , we have f(x) ^ f{x e«) if 
and only if in the tribe of i, all the coordinates of x other than Xi are ones, and in each of the 
other tribes, at least one of the coordinates is zero. Thus, the influences of / are: 

h{f) = Pr[/(s) + f{x et)] = - p^W*)- 1 « f- 1 = l °f" . 

Summing over the values of i, we get 

^-XHIEM' ^ . (37) 



i=l 



Since by Assumption ([36]) . d < \^y p ^ i we have 



OogCl/WC/)))"- 1 * ((l-^Mjlogn) 1 «(bgn) d - 1 . (38) 
V log n / 

It follows that for our function /, the right hand side of @ is approximately 

5e (2B{p)-e\ d - 1 (logn)^ 1 



d \ d— 1 / p(log(l/p)) 2 • n 

Finally, since by Equation Q, < 2p (i- P ) i g((i-p)/ P ) ' and since Dy Assumption ([36]), d < 

1/p, the right hand side of (j4]) is at most 

5e / e \ _1 (logn) d+1 / e\ d (logn) d+1 



d \{d— 1) -p{l — p)log((l — p)/p)J p(\og{l/p)) 2 ■ n \dJ p d ■ (log(l/p)) d+1 • n 

(40) 

Evaluating the left hand side of ([4]). We compute a lower bound on the l.h.s. of 

by considering only part of the cf-th level Fourier- Walsh coefficients of /. We compute the 
coefficients of the form /({ii, • • • , id}) where {ii,---,id} belong to the same tribe, the idea 
being that these are the dominant d-th level coefficients. We want to compute 

f({h,...,id}) = E[/ • U{ iu ...,i d }]. 

We divide {0, l} n into structures of 2 d values each according to the coordinates . . . ,id}- 

Note that a structure does not contribute to ElJ-w;^ if the value of / on all the elements of 

the structure is the same. Since . . . , id} are all in the same tribe, the only case in which / is 
not constant on a structure is when in the tribe containing . . . , i^}, all the other coordinates 
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are ones, and in each of the other tribes there is at least one zero element. For such structures, 
/ assumes the value 1 only when all the coordinates {ii,. . . ,id} are ones. Hence, 



nf-cj {il _ id} \ = (-x/(i-p)/p)V(i-/) (n/rM ~ (-i)V d/ v ~ (-i) d P - d/2 - 



log n 



log(l/p) -n 
(41) 

The number of d-th level coefficients of this type is ^Q)- Since by Assumption (|36p . we have 
d < Jr and d < l °^&n it follows that 

— v — logn ' 

n ( r\ ^ n (r — d) d nr d ~ l n(logn) d_1 



r\dj ~ r d\ d\ ^^(l/p))^ 1 ' 

Therefore, using Stirling's approximation d\ ~ \/27rd(d/e) d , we get the following lower bound 
on the left hand side of (jl]): 

V 1(0)2 > nQogn)^ 1 (logra) 2 ~( e Y {logn) d+1 

^ d J{ ' «d!(log(l/p))d-i > • (log(l/p)) 2 • n 2 ~ KdJ Vd.pd.(\og(l/p)) d +±-n { ' 

Comparing expressions ()40p and (|42p shows that the assertion of Lemma [6] is tight up to factor 
cy/d, which is small compared to the other terms in the expressions in both sides of Equation . 

5.2 Tightness of Theorem [7] 

In order to show that the assertion of Theorem [7] is tight up to a constant factor in the exponent, 
we have to prove that 

log5 e (/)«a(e)-e-log>V(/). (43) 

To simplify the computations, we assume that p < 1/logn, and in particular we have r < 1/p 
(we also deal separately with the uniform measure case below). 

We compute a lower bound on (log<S e (/)) by considering part of the r-th level Fourier- Walsh 
coefficients of /. By Formula (I41D . each of the coefficients f({ix,i2, ■ ■ ■ , v}) corresponding to a 
full tribe equals 

(-v(i-p)/p)V(i-P r ) (n/rM ~ (-i)v r/ v = (-i)v /2 . 

The number of coefficients of this form is n/r. Thus, 

v hsf > ^ « - ";°f re = e(i). 

~ r log 1/p -n-r 

\S\=r BV 1 ' 

Seif)> £ f(S) 2 (l-e) r >(l-eY, 

\S\=r 

log(S«(/)) § !2iilziiip. (44 ) 
log(l/p) 

On the other hand, using Formula (f38l) and the approximation log(2£>(p)e) log(l/p), we get 

^)-logW(/)«^. (45) 



Hence, by Claim [HJ 



and therefore, 
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Comparing Formulas (|44p and (|45p yields Formula (|43p . Moreover, it can be seen from the 
proof that the exponent in the assertion of Theorem [7] is tight up to factor 



(1 + (1))__1- 
-log(l 



which tends to 1 for small e. 



5.2.1 The Uniform Measure Case 

A similar computation shows the tightness up to constant factor in the exponent in the case 
p = 1/2 (namely, tightness of TheoremH]). This time we compute a lower bound on (log«S e (/)) 
by considering the Fourier- Walsh coefficients of / all of whose coordinates are contained in the 
same tribe. It is easy to check that for p = 1/2, all such coefficients are of order log 2 n/n. For 
each of the n/r tribes, there are 2 r coefficients that correspond to its subsets, and thus, 

y hs) 2 *-- ^.^2 = e(i). 

^-^ r n 

S contained in a tribe 

Hence, like in the previous case we get 

S e (f)> £ f(S) 2 (l-eT>(l-ey, 

S contained in a tribe 

and the rest of the proof is the same as in the previous case. 

We note that in this computation, the value of C in the exponent tends to 1 as e — >• 0. A 
refined computation can probably improve the value of the constant, but we won't be able to 
match the value C = .234 asserted in Theorem [H since it follows from Corollary 12 in [MO03 
that for the tribes function with p = 1/2, we have 

Se(f) £ W(/) log2(e)/2 = W(/)' 721 . 
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7 Appendix 

In this appendix we prove a decoupled variant of Lemma [5l stated as Theorem [16] below. This 
variant generalizes Theorem 2.4 of [Tal96], that was used by Talagrand to establish a lower 
bound on the correlation between monotone subsets of the discrete cube. While employing 
the same basic idea, our proof is shorter than Talagrand's proof, and applies also to a biased 
measure \i v on the discrete cube. For the sake of generality, we present the proof in the biased 
case, thus providing a decoupled variant of Lemma El As in the proof of Lemma [6j we use 
throughout the appendix the normalized variant of the influences: lUf) = \/p(l — p)h(f)- 

We start with a generalization of Lemma 3.1 in [Tal96]. The proof is a slight modification of 
the proof of Lemma El 
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Lemma 14. Let f : {0, l} n — > {0, 1} be a function, and let {I, J} be a partition of {1, . . . , n}. 
For t > 0, denote 

L{ = {j€J: £ /(T,j) 2 >t./;.(/) 2 }. 



TCI 

\T\=m-l 



Then for all t > 4 • (45 (p) • e) d_1 , 



E < 5e • (V4)"^ • B(p) • exp (-^^ • (t/^V^Y (46) 

Proof: Exactly the same argument as used in the proof of Lemma [6] (see Equation [22]) shows 
that for all t > (4B(p) • e) {d " 1)/2 , 

E E f( T J) 2 < 2 4- E ^^) 2+10e '*r^ - ex p (-^^^ -*o /(d-1) )- ( 47 ) 



^ ei * |T|=d-l jeLt 



On the other hand, by the definition of L{ we have 



E E /W>E^'(/) 2 = i -EW) 2 («) 



Taking to = and combining Inequalities ((371) and (|38j) . we get 
i • E W ^ (*/2) • E + 10e • • B(P) • exp (-^=^ 



■ 0/4) 



(49) 

and simplification yields the assertion. □ 

We now present a decoupled variant of Lemma [T31 The proof is a series of applications of the 
Cauchy-Schwarz inequality. 

Lemma 15. Let f,g : {0, l} n — > {0, 1} 6e functions, such that 

n n 

E?(/) J <i. E J ^) 2 <i> (so) 

and Zei {J, J} 6e a partition of {1, ... ,n}. For t > 0, denote 

L t = {j€J: E f(TJ)9(.TJ)>t-l' j (f)l' j (g)}. 



TCI 

\T\=d-l 



Then for all t > 4 • (4J3(p) • e) d ~ l , 



E Wife) < 2 - (Se) 1 / 2 • (t/4)-^T . (B(p))V2 - exp ("^^ ■ (i/4) 1 /^ 1 )). (51) 



Proof: Define the sets l{ and l? t as in Lemma [T4l above. By the Cauchy-Schwarz inequality 
and Assumption ([50]) . 

1/2 





r V /2 


( Y /2 




E ») * 


E ^(/) 2 




E wA 






V 6 ^ / 





and thus by Lemma [T 

E I'jWfa) < ( 5e ) V2 • ■ (B(p)) 1 / 2 ■ exp 

3&4 



(d-1) 
4B(p) ■ e 



(t/4) 



i/(d-i) 



The same holds for YljeL" Ijif)Ij(9)- Now we note that by the Cauchy-Schwarz inequality, 
Lf C L t U , and thus, 

E * E WW + E mm 



jeLt 



as asserted. □ 



< 2 • {he) 1 ' 2 ■ (t/4)" • (J?(p))Va . exp (-^j^ ■ (t/4) 



Now we are ready to present the decoupled version of Lemma 

Theorem 16. For all d>2, and for any functions f,g : {0, l} n -4 {0, 1} such that: 

n n n 

EW<1- EW<1, ™* £3(/)J$(0)<exp(-2(d-l)), (52) 

3=1 j'=i i=i 



E f^9( S ) 
\S\=d 



< 



70e (AB{p)-e 



d \ d-1 



d-1 I n 



d 



d-l 



(53) 



Proof: As in the proof of Lemma[6j we first consider a partition {/, J} of {1, . . . , n}, and prove 
that 



E E /(T,,»(T, J )<70.(^M_£)"iE^ 

jeJ tci v y Vie J 

|T|=d-l V 



(f)i'(g) 



(54) 



d-1 



log 



We apply Lemma [TTI with = J endowed with the uniform measure, and the functions 

T,Tcl,\T\=d-if(. T >j)9(TJ) 



h(j) = r j (f)i' j (g), and -fcC?) 
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Noting that the set L(t) = {j : /2C7) > t} is exactly the set L t defined in Lemma [T5l we get 

E E /Wj^f & ( 55 ) 

t 7 1 > / 



|T|=d-l 

\<f-l 



Therefore, using Lemma [T5l we have, for all t% > 4 • (4B(p) ■ e) 

m = J2 ^£ Wwj^+J^ fE J K/)^^)V^*iE^(/) J K5)+ 

+ f°° (2- {be) 1 ' 2 • (t/4)-^) • (£(p)) 1/2 • exp f- ~ 1} • (t/^V^-D^rft. 



t=ti 



4B(p) • e 



We evaluate the integral in the same way as the integral in Lemma fT2| and obtain that for all 
h >4,-(8B(p)-e) d -\ 



m < tiEWW + 18 - 74 • ( B (P)) 1/2 ■ (16B(p)e) • • exp (- 



4B(p) • e 

(56) 

We then choose ti such that 

(Note that due to Assumption (l52j) . we have ii > 4 • (8B(p) ■ e) d ~ l , as required). Substituting 
t\ into Inequality (|56|) . we get 



m < 70. ( ^V _1 f Eww] ( log E^(7^p^ ' (57) 



proving (j54j) . Finally, the derivation of (|53j) from (|54|) is exactly the same as the last step in 
the proof of Lemma [6l □ 
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